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ABSTRACT 


Previous  research  activity  at  The  George  Washington  University  has 
demonstrated  the  feasibility  of  designing  a  compiler  which  accepts  a  high  level 
(programming)  language  (HLL)  and  produces  microcode  for  a  computer  with  a 
horizontal  control  word  format.  On  going  research  activity  described  herein 
is  demonstrating  the  feasibility  of  designing  a  compiler  with  PASCAL  as  an 
input  HLL  that  can  generate  microcode  for  the  DEC  VAX  11/780  computer  which  is 
installed  in  the  BMDATC  Distributed  Data  Processing  (DDP)  Test  Bed.  The  effort 
described  has  as  its  goal  the  installation  of  this  compiler  at  the  Test  Bed  and 
to  interface  it  with  the  "User  Microprogramning"  support  software  provided  by 
DEC.  Additional  tasks  include  optimizing  the  performance  of  the  compiler  and 
conducting  a  study  of  the  interface  between  compiler  generated  microcode  and 
CPU  architectures. 
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INTRODUCTION 


The  use  of  microprogramming,  especially  for  high  usage  segments  of  computer 
programs,  as  a  way  to  enhance  hardware  performance  has  been  demonstrated  many 
times  (  1,2).  Since  the  advent  of  minicomputers,  circa  1970,  the  manufacturers 
of  these  systems  have  offered  their  customers  a  "user"  microprogramming  option. 
This  consists  of  a  small  writable  control  storage  (WCS)  and  micro  assemblers 
and  simulators  to  support  generation  of  microprograms.  There  hasn't  been  a  wide 
acceptance  of  this  "user"  microprogramming  option  by  the  purchasers  of  mini¬ 
computers  primarily  because  generation  of  microprograms  is  an  error  prone  and 
costly  task.  The  basic  support  offered  the  "user"  microprogrammer  has  simply 
been  inadequate  to  overcome  the  difficulties  of  preparing  and  using  microcode. 

The  purpose  of  the  research  described  in  this  report  is  to  develop  additional 
tools  to  support  the  "user"  microprogrammer.  The  use  of  high  level  languages 
(HLLs)  for  expressing  computer  programs  are  taken  for  granted  by  conventional 
software  programmers  and  the  use  of  this  technique  to  generate  microcode  appears 
promising.  Establishing  feasibility  that  HLL  compilers  can  be  developed  to  pro¬ 
duce  efficient  microcode  is  the  main  objective  of  this  research  activity.  Ef¬ 
ficient  microcode  in  this  context  means  microcoded  routines  which  significantly 
enhances  hardware  performance  on  critical  often  used  software  segments  hereafter 
referred  to  as  kernals. 

The  research  described  in  this  report  has  been  underway  for  about  two  years. 
In  the  beginning  two  compilers  were  available.  One  produced  microcode  from  the 
MPL  HLL  for  the  INTERDATA  Mod  3  and  the  other  generated  quadruples  from  the  PLM 
HLL  (3,4)  .  The  INTERDATA  Mod  3  was  designed  with  a  vertical  control  word  format 
consisting  of  16  bits  and  four  modes.  Generating  microcode  for  this  machine  is 
in  many  ways  similar  to  conventional  assembly  language  programming.  As  was  to  be 


expected,  the  compiler  designed  to  produce  INTERDATA  Mod  3  microcode  was  quite 
efficient.  The  big  challenge  was  to  produce  microcode  via  a  compiler  for  a 
computer  with  a  horizontal  control  word  format.  A  horizontal  control  word  may 
be  from  50  to  100  bits  wide  and  have  from  20-40  control  fields. 

The  first  research  effort  in  this  activity  consisted  of  using  the  PLM  to 
quadruple  compiler  to  generate  microcode  for  the  DEC  PDP  11/45  machine  (4)  . 

A  quadruple  to  11/45  microcode  translator  was  developed  and  performance 
measured.  The  results  were  encouraging  which  lead  to  a  much  more  challenging 
research  effort  (5)  to  change  both  the  input  to  the  compiler  and  to  generate 
microcode  for  the  DEC  VAX  11/780  machine  which  offered  a  "user"  microprogramming 
option.  The  input  to  the  compiler  was  changed  from  PLM  to  PASCAL  and  the  output 
again  went  via  the  intermediate  quadruple  format  directly  to  microcode  format 
of  the  VAX  machine  which  has  a  control  word  consisting  of  96  bits  with  nearly 
30  control  fields.  Unfortunately,  at  the  time  of  this  research  effort  DEC  hadn't 
released  the  "user"  microprogramming  support  manuals  and  this  greatly  hindered 
our  ability  to  determine  if  the  microcode  generated  was  correct  and  to  measure 
its  performance. 

At  the  inception  of  the  research  effort  of  this  present  contract  in  November 
1979,  it  was  decided  to  transfer  the  compilers  which  produce  microcode  for  the 
VAX  machine  to  be  operable  on  the  BMDATC  Test  Bed  VAX  systems.  A  companion 
ARO  contract  (6)  was  established  to  transport  the  PASCAL  to  VAX  microcode 
compiler  from  the  IBM  370  so  that  it  would  execute  on  the  VAX  system.  To  do 
this,  the  XCOM  compiler,  which  is  part  of  the  XPL  Translator  Writing  System, 
was  modified  to  produce  quadruples  as  an  output  instead  of  370  machine  language. 
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An  additional  program  was  written  to  generate  VAX  MACRO  language  statements  from 
the  quadruples.  This  research  activity  has  been  slowed  by  many  unusual  events 
including  a  two  month  shut  down  of  the  Computer  Center  at  GWU  along  with  the 
necessity  to  replace  the  programmers  working  on  the  compiler  twice.  As  a  result 
this  program  fell  behind  schedule  and  wasn't  able  to  support  some  aspects  of  this 
research  effort. 

Another  major  problem  was  the  long  delay  in  release  of  the  VAX  "user" 
microporgramming  support  documentation.  This  material  wasn't  received  until 
April  of  '80.  which.,  hampered  our  ability  to  verify  the  correctness  of  the 
compiled  VAX  microcode.  The  combined  effect  of  the  two  problem  areas  was  to 
lead  to  our  concentration  on  the  quadruple  to  VAX  microcode  portion  of  the 
compiler.  The  PASCAL  to  quadruple  compiler  developed  under  the  previous  research 
effort  was  essentially  assumed  to  work  "as  is"  and  the  performance  optimization 
and  measurement  activity  concentrated  on  the  quadruple  to  microcode  translator. 
Fortunately,  this  approach  worked  quite  well  and  good  estimated  performance 
measurements  were  obtained.  Confirmation  of  these  performance  measurements  on 
VAX  systems  at  the  BMDATC  Test  Bed  are  now  underway. 

There  are  four  objectives  for  this  research  activity.  The  first  was  to 
complete  the  transfer  of  the  PASCAL  to  VAX  11/780  microcode  compiler  to  be 
operational  on  the  BMDATC  Test  Bed.  Achievement  of  this  objective  was  contingent 
on  the  ARO  companion  effort  noted  above  which  has  fallen  behind  schedule.  Ac¬ 
cordingly,  the  accomplishment  of  this  objective  has  also  fallen  behind  schedule. 
Rapid  progress  is  being  made  and  an  operational  PASCAL  to  VAX  microcode  compiler 
should  be  available  at  the  BMDATC  Test  Bed  in  the  fall  of  '80.  The  second 
objective  was  to  measure  the  performance  of  the  PASCAL  to  VAX  microcode  compiler. 
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This  objective  has  been  largely  met  with  the  exception  of  confirming  estimated 
execution  times  with  actual  timing  runs  on  the  VAX  systems  at  the  BMDATC  Test 
Bed.  An  attempt  to  do  this  failed  because  of  systems  software  problems  within 
the  VAX  VMS  support  software.  The  third  objective  was  to  optimize  the  PASCAL  to 
VAX  microcode  compiler  performance.  Again  this  activity  was  concentrated  on  the 
quadruple  to  VAX  microcode  translator  and  good  results  were  obtained.  The  main 
point  here  was  to  transfer  data  requests  to  the  VAX  internal  registers  to  avoid 
the  overhead  of  main  storage  references.  The  fourth  objective  was  to  study  the 
interface  between  the  compiler  and  the  hardware  microcode.  This  was  done  in 
conjunction  with  objective  3  and  a  better  understanding  of  this  interface  was 
developed,  however,  much  remains  to  be  done  in  this  research  area. 

The  results  obtained  from  this  research  effort  are  mainly  to  demonstrate 
that  a  HLL  compiler  can  produce  efficient  microcode  for  the  VAX  machine.  Spe¬ 
cific  results  to  be  presented  later  indicate  that  compiler  produced  microcode  is 
nearly  comparable  to  microcode  produced  by  hand.  This  should  certainly  be  the 
case  for  the  "occasional "microprogrammer  who  wouldn't  develop  the  skill  level 
that  could  be  obtained  by  a  full-time  microprogrammer.  Secondly  the  optimization 
techniques  incorporated  in  the  quadruple  to  VAX  microcode  translator  work  quite 
well  and  appear  to  be  applicable  to  producing  microcode  for  a  wide  variety  of 
host  machine  hardwares.  Finally  a  better  understanding  of  the  compiler  to 
microcode  generation  process  will  lead  to  the  specification  of  host  machines 
which  are  better  suited  for  interfacing  to  high  level  languages  in  general. 

In  view  of  the  results  obtained  to  date  in  this  research  activity  to  produce 
compilers  from  HLL's  to  microcode,  it  can  be  concluded  the  feasibility  has  been 
established  and  progress  is  being  made  in  reduction-to-practice  of  this  technique. 


4 


Adding  this  tool  to  the  available  microprogramming  support  should  make  "user" 
microprogramming  a  much  more  worthwhile  option  for  purchasers  of  hardware 
with  this  feature.  With  the  availability  of  cheaper  hardware  and  the  ever 
increasing  cost  of  software,  the  use  of  microprogramming  to  enhance  system 
flexibility  appears  to  be  an  attractive  advantage..  This  research  activity  will 
lead  to  better  support  tools  for  microprogrammers  and  ultimately  to  a  wider 
use  of  this  technique  to  improve  both  hardware  performance  and  flexibility. 

The  final  report  includes  six  sections  and  four  appendices.  After  this 
introductory  section,  there  follows  a  section  on  research  objectives  and  one 
on  research  accomplishments.  A  fourth  section  deals  with  compiler  performance 
followed  by  a  section  on  the  HLL  compiler  hardware  interface.  The  report 
concludes  with  a  section  containing  summary  comments  on  the  significance  of 
this  research  and  suggestions  for  follow-on  activity. 


RESEARCH  OBJECTIVES 


Before  beginning  a  detailed  discussion  of  the  objectives  of  this  research 
effort,  a  few  preliminaries  are  in  order.  The  initial  activity  was  to  demon¬ 
strate  the  feasibility  of  producing  with  a  compiler  microcode  for  a  computer 
employing  a  horizontal  control  word  format.  This  was  demonstrated  (4)  using 
the  DEC  PDP  11/45  as  a  host  machine  which  has  a  56  bit  control  word  with  nearly 
20  control  fields.  This  research  revealed  the  importance  of  the  compiler  hard¬ 
ware  interface  and  especially  the  necessity  of  taking  advantage  of  available 
registers  in  the  host  machine  to  minimize  main  storage  references.  The  next 
effort  was  an  extension  of  the  feasibility  study  to  change  the  simple  PLM  source 
language  for  the  compiler  to  PASCAL  and  to  replace  the  DEC  11/45  by  the  VAX 
11/780  as  the  host  machine.  The  VAX  11/780  is  a  much  more  complex  host  machine 
than  the  11/45  and  utilizes  a  96  bit  control  word  format.  Finally  a  companion 
effort  was  initiated  to  transport  the  PASCAL  to  VAX  11/780  microcode  compiler  to 
the  VAX  systems  within  the  BMDATC  Test  Bed. 

The  first  objective  of  this  research  contract  was  intended  to  be  supportive 
of  the  other  three  and  as  a  follow-on  to  the  ARO  contract  objectives.  By  trans¬ 
ferring  the  PASCAL  to  VAX  microcode  compiler  to  be  operational  at  the  BMDATC 
Test  Bed,  it  was  hoped  that  other  BMDATC  contractors  would  be  encouraged  to  take 
advantage  of  the  "user"  microprogramming  capability  of  the  VAX  hardware  to  en¬ 
hance  their  system  performance.  To  support  this  goal  it  was  necessary  to  make 
the  PASCAL  to  VAX  microcode  compiler  available  on  the  BMDATC  Test  Bed  as  well 
as  provide  support  documentation  as  to  how  these  facilities  are  to  be  used.  As 
an  initial  step  in  this  direction  the  PASCAL  compiler  had  to  be  well  documented 
and  a  "users"  manual  is  required.  In  addition,  the  compiler  must  be  made  resident 
on  the  VAX  systems  at  the  BMDATC  Test  Bed  and  local  systems  programming  personnel 
had  to  be  familiarized  with  how  to  use  this  facility. 
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The  second  objective  was  to  evaluate  the  performance  of  the  PASCAL  to  VAX 
11/780  microcode  compiler.  This  would  proceed  through  several  phases  starting 
with  the  relatively  unoptimized  version  available  at  the  inception  of  this  re¬ 
search  activity  and  moving  on  to  the  optimized  versions  of  the  compiler.  The 
performance  of  compiler  generated  microcode  would  be  measured  against  hand 
generated  microcode  and  also  against  versions  of  the  test  programs  coded  in 
other  high  level  languages  available  on  the  VAX  system,  i.e.  mainly  FORTRAN  IV. 
Terms  of  measurement  would  be  in  running  time  and  where  possible  in  terms  of 
control  storage  space  utilization. 

Objective  three  would  be  corollary  with  objective  two  in  that  optimization 
of  the  compiler  would  be  carried  out  to  provide  better  performance  against 
hand  coded  microprograms  and  also  in  terms  of  running  time  against  versions  of 
the  test  programs  coded  in  other  HLL's  resident  on  the  VAX  system.  This  ob¬ 
jective  has  two  aspects,  namely  optimization  of  the  PASCAL  to  quadruple  compiler 
and  optimization  of  translation  of  quadruples  to  microcode.  The  latter  aspect 
proved  to  be  especially  significant  in  generation  of  microcode  for  the  PDP  11/45. 
Fairly  standard  optimization  techniques  will  be  considered  in  achieving  this  ob¬ 
jective  for  the  PASCAL  to  quadruple  compiler.  Register  allocation  algorithmns 
will  be  used  to  the  maximum  extent  possible  to  optimize  the  quadruple  to  VAX 
microcode  translator  to  improve  run  time  performance  and  minimize  the  amount  of 
control  store  usage. 

The  fourth  objective  is  to  study  the  interface  between  the  PASCAL  to  VAX 
11/780  microcode  compiler  and  the  CPU  architecture.  While  this  interface 
physically  is  at  the  microcode  level,  factors  such  as  internal  register  avail¬ 
ability,  HLL  features,  and  the  choice  of  an  intermediate  representation  all  play 


7 


a  role  in  determining  total  system  performance.  The  experience  gained  in  gen¬ 
erating  microcode  for  the  POP  11/45  from  the  PLM  HLL  demonstrated  the  importance 
of  this  objective.  In  generating  test  cases,  it  was  discovered  that  for  all  but 
one  of  the  test  cases,  Fibonacci  Series,  it  wasn't  possible  to  efficiently  hand 
microprogram  them  because  of  the  shortage  of  internal  registers  in  the  PDP  11/45 
CPU.  From  the  extent  to  which  the  goals  of  this  objective  are  reached,  we  should 
be  better  able  to  determine  architectural  specifications  of  host  machines  to 
enhance  their  microprogranmabil ity  and  ultimately  their  performance  in  a  wide 
range  of  applications. 

The  four  objectives  for  this  research  effort  which  are  described  above  span 
a  wide  range  of  issues  in  the  design  of  compilers  to  produce  microcode.  As  in 
any  research  effort  all  objectives  aren't  equally  achieved  and  as  activity  pro¬ 
gresses  new  directions  may  appear  and  are  pursued.  Another  factor  is  the  research 
environment  which  can  often  lead  to  decisions  to  pursue  different  approaches  to 
avoid  obstacles  that  impede  progress  in  the  initially  choosen  directions.  This 
contract  activity  encountered  many  problems  and  directions  were  changed  to  maxi¬ 
mize  the  accomplishments  as  will  be  described  in  the  next  section. 


PROJECT  ACCOMPLISHMENTS 


A  number  of  accomplishments  have  been  achieved  as  a  result  of  the  research 
activity  into  the  design  of  compilers  to  convert  computational  algorithms  ex¬ 
pressed  in  a  HLL  into  microcode.  Some  of  the  high  lights  include  a  high  degree 
of  optimization  of  the  efficiency  and  performance. of  the  microcode  produced  by 
the  compiler  which  is  executable  on  the  VAX  11//80  CPU.  Another  significant 
accomplishment  is  the  greater  understanding  of  the  interface  between  compilers 
which  generate  microcode  and  CPU  architectures.  As  design  techniques  for  micro¬ 
programmed  CPU's  are  improved,  the  interface  between  the  software  and  firmware 
and  these  architectures  becomes  a  more  critical  technical  consideration.  While 
many  of  the  research  objectives  were  met,  not  all  were  and  in  some  cases  a  re¬ 
direction  was  required  to  compensate  for  circumstances  arising  during  the  research 
activity. 

Two  events  beyond  our  control  contributed  to  the  degree  of  accomplishment 
of  the  research  objectives.  The  first  was  the  delay  until  April  '80  of  the 
receipt  of  the  complete  DEC  support  documentation  on  how  to  microprogram  the  VAX 
CPU.  Some  preliminary  and  partial  information  became  available  in  January  '80 
but  lacking  the  complete  documentation  adversely  affected  the  quadruple  to  micro¬ 
code  translator  development.  The  second  problem  was  the  shut  down  of  the  computer 
facilities  at  GWU  which  were  required  to  support  this  research.  This  led  to  a 
late  start  in  achieving  objective  1  to  get  the  PASCAL  compiler  operational  on 
the  VAX  System.  A  third  problem,  characteristic  of  a  graduate  school  research 
environment,  is  the  high  rate  of  turnover  of  research  staff.  This  research  pro¬ 
ject  was  especially  hard  hit  by  the  necessity  of  replacing  two  lead  compiler 
programmers  along  with  the  unavoidable  training  period  for  replacements. 


The  accomplishments  for  each  objective  will  be  discussed  below.  It  should 
be  emphasized  that  this  is  an  ongoing  research  effort  and  objectives  not  entirely 
achieved  at  the  time  this  report  is  being  written  will  be  accomplished  in  the 
near  future.  The  status  of  accomplishment  for  each  objective  will  include  an 
estimated  completion  date  where  appropriate. 

Progress  on  objective  one  to  make  the  PASCAL  to  VAX  11/780  microcode  com¬ 
piler  operable  on  the  BMDATC  Test  Bed  is  behind  schedule  by  about  two  to  three 
months  for  the  reasons  noted  above.  Work  on  interfacing  this  compiler  with  the 
DEC  "User"  Microprogramming  support  documentation  is  underway.  A  preliminary 
user's  microprogramming  manual  has  been  prepared  and  is  shown  at  appendix  B. 

Of  particular  importance  is  the  design  of  invoking  programs  which  provide  an 
interface  between  the  "user"  microprogram  and  the  VMS  operating  system.  An 
overview  block  diagram  of  the  invoking  procedure  is  shown  in  figure  1  and  a 
sample  invoking  program  is  shown  in  figure  2.  Another  example  of  "user"  docu¬ 
mentation  intended  for  operational  personnel  at  the  BMDATC  Test  Bed  is  shown  at 
appendix  A.  It  is  a  documented  program  written  in  the  XPL  language  which  converts 
quadruples  to  the  VAX  system  MACRO  language  along  with  a  variation  of  this  translator 
which  is  still  in  development  produces  microcode  for  the  VAX  11/780  from  quad¬ 
ruples.  In  summary  the  status  of  objective  one  is  that  it  is  only  partially 
complete  and  behind  schedule.  While  some  of  the  check  out  of  the  compiler  can  be 
carried  out  on  the  VAX  11/780  at  GWU,  a  major  effort  will  also  be  required  at  the 
Test  Bed  to  bring  it  into  operational  status  there. 

Objective  two  is  to  evaluate  the  performance  of  the  PASCAL  to  VAX  11/780 
microcode  compiler  in  terms  of  operational  performance  and  microcode  storage 
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Figure  2  Microcode  Invoking  Program  in  VAX  11/780  MACRO  Language 


requirements.  After  getting  off  to  a  slow  start  due  to  lack  of  availability  of  the 
DEC  support  documentation  for  "user"  microprogramming,  considerable  progress  has 
been  made  in  evaluating  the  compiler  performance  which  looks  quite  acceptable. 

One  missing  detail  for  this  objective  is  to  conduct  timing  runs  of  the  microcode 
produced  both  by  hand  and  by  compiler  in  the  VAX  systems  installed  at  the  Test  Bed. 
An  attempt  was  made  to  do  this  but  failed  due  to  problems  with  the  VAX  system  soft¬ 
ware  supporting  the  timing  procedure.  The  test  approach  adopted  for  this  objective 
was  to  initially  code  the  four  test  programs  in  four  ways.  These  are  to  code  the 
test  program  in:  DEC  FORTRAN  IV,  PASCAL,  microcode  derived  from  quadruples,  and 
hand  coded  microcode.  Consideration  was  given  to  also  coding  the  test  programs  in 
the  VAX  MACRO  language  but  this  was  dropped  when  it  became  apparent  that  there 
would  be  little  improvement  in  this  representation  over  the  MACRO  version  generated 
by  the  FORTRAN  IV  compiler.  Timing  tests  were  made  on  the  FORTRAN  IV  version  of 
the  test  programs  and  estimated  execution  times  were  generated  for  the  quadruple 
and  hand  coded  microcode  versions.  It  is  these  latter  versions  that  we  were  unable 
to  get  actual  running  times  on  the  Test  Bed  VAX  systems.  Because  of  the  importance 
of  the  accomplishments  for  objective  2,  it  will  be  covered  in  more  detail  in 
section  four  dealing  with  the  compiler  performance. 

Accomplishments  for  objective  three  are  corollary  to  those  for  objective  2. 
Optimization  of  the  PASCAL  to  VAX  11/780  microcode  compiler  is  decomposed  into 
two  parts.  The  first  part  deals  with  the  PASCAL  to  quadruple  compiler.  Because 
of  the  problems  noted  above  for  objective  one,  it  wasn't  possible  to  make  an 
attempt  to  significantly  Improve  this  compiler  performance  in  a  general  way.  It 
is  also  our  observation  from  earlier  research  experience  that  significant  gains 


in  microcode  operational  performance  are  unlikely  to  be  due  to  the  design  of  the 
compiler.  Instead  in  accomplishing  objective  three  all  the  stress  was  placed  on 
optimizing  the  performance  of  the  quadruple  to  VAX  11/780  microcode  translator. 
Here  significant  gains  in  performance  were  obtained  by  taking  advantage  of  alloca¬ 
ting  data  to  registers  and  combining  quadruples  into  one  microinstruction  were 
possible.  This  required  definition  of  special  microcoded  MACROS  which  is  the 
interface  to  the  "user"  microprogramer  supported  by  the  DEC  microprogramming 
documentation.  While  all  aspects  of  optimizing  the  generation  of  microcode  from 
quadruples  haven't  been  investigated,  the  performance  of  the  microcode  generated 
from  quadruples  compared  to  hand  coded  microcode  looks  very  good.  Again  because 
of  the  significance  of  our  accomplishments  for  this  objective  they  will  be  dis- 
dussed  in  more  detail  in  section  four  below. 

The  study  of  the  HLL  compiler  CPU  interface  is  an  out  growth  of  earlier  ob¬ 
servations  of  the  critical  nature  of  this  boundary  between  software,  firmware  and 
hardware.  Even  more  interesting  is  the  trend,  especially  apparent  in  the  VAX 
machine,  for  CPU  architectures  to  directly  reflect  HLL  representations.  Examples 
include  the  support  of  procedures,  subroutine  calls,  stacks,  and  a  host  of  other 
HLL  features  at  the  machine  language  level.  This  leads  to  direct  interpretation 
of  these  HLL  features  at  the  microcode  level  and  eliminates  the  necessity  of 
encoding  these  features  in  conventional  machine  language,  e.g.  the  DEC  PDP  II 
assembly  language.  This  introduction  of  HLL  concepts  directly  into  the  machine 
language  of  the  CPU  tends  to  diminish  the  advantages  of  microprogramming  computa¬ 
tional  algorithms  to  gain  run  time  performance.  As  will  be  noted  below  this  is 
born  out  by  our  performance  evaluation  showing  only  about  3  to  4  to  one  advantage 
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of  hand  microcoded  algorithms  over  the  FORTRAN  IV  equivalent.  This  shows  the 
significance  of  this  trend  in  CPU  architecture  design.  While  performance  ad¬ 
vantages  may  not  be  so  great  in  using  directly  microprogrammed  kernals  in  the 
VAX  system,  the  flexibility  of  microprogramming  can  still  offer  lots  of  advan¬ 
tages.  To  investigate  this  further  requires  selection  of  algorithms  which  will 
achieve  maximum  benefit  from  microprogranming  and  evaluate  the  effectiveness  of 
this  representation  as  opposed  to  representations  generated  at  the  FORTRAN  IV 
or  MACRO  language  level.  Again  because  of  the  importance  of  this  objective,  it 
is  discussed  in  more  detail  in  section  five. 

Considerable  progress  has  been  made  on  the  objectives  originally  stated  for 
this  research  effort  in  spite  of  the  many  obstacles  encountered.  Much  remains  to 
be  done  and  the  proposed  on-going  research  addresses  many  of  these  issues.  Most 
important,  a  good  foundation  is  being  established  utilizing  "state-of-the-art" 
hardware  to  develop  tools  to  assist  in  the  generation  of  microcode.  With  some 
estimates  of  the  cost  of  generating  a  line  of  microcode  running  as  high  as  $1000.00, 
any  improvement  in  the  productivity  of  microprogrammers  can  bring  about  signi¬ 
ficant  savings.  In  spite  of  the  trend  of  CPU  manufacturers  to  incorporate  HLL 
features  directly  in  hardware,  this  flexibility  of  being  able  to  directly 
microprogram  critical  software  kernals  makes  the  development  of  tools  to  assist 
the  microprogrammer  a  high  priority  effort.  As  more  is  learned  about  the  com¬ 
pilation  procedures  for  microcode,  better  advantage  can  be  taken  of  available 
hardware  performance.  It  is  toward  this  objective  that  the  long  range  goals  of 
this  research  effort  are  directed. 
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COMPILER  PERFORMANCE 


HLL  Compiler  performance  can  be  measured  in  many  ways.  These  include: 
impact  on  programmer  productivity,  readability  and  documentation  of  source  code, 
computational  thruput  of  the  object  code  produced  by  the  compiler,  and  amount  of 
storage  space  required  for  a  compiled  program.  From  these  fundamental  measures 
other  criteria  can  be  derived  including  compiler  efficiency  (7)  which  is  a 
comparison  of  computational  thruput  and  storage  requirements  of  an  algorithm  coded 
in  a  HLL  and  assembly  language.  To  optimize  the  performance  of  a  compiler,  it  is 
common  practice  to  try  to  reduce  the  computational  time,  commonly  referred  to 
as  "run  time"  of  the  generated  object  code.  Optimization  of  the  run  time  of  the 
object  code  often  requires  additions  to  the  time  it  takes  to  compile  a  program. 

If  compile  time  also  is  an  important  factor,  then  tradeoffs  may  be  required  between 
object  code  optimization  and  compiler  run  time. 

In  this  research  project  microcode  execution  time  was  the  goal  of  the  op¬ 
timization  effort.  Compile  time  wasn't  considered  nor  were  control  storage  re¬ 
quirements.  As  noted  before,  optimization  activity  was  directed  at  the  quadruple 
to  microcode  translator.  The  main  factors  to  be  considered  were  the  allocation  of 
registers  to  the  variables  being  modified  at  any  particular  segment  of  the  program, 
enablement  of  branch  operations,  and  accessing  data  arrays.  By  careful  design  of 
the  translator  program,  we  were  able  to  generate  microcode  which  was  comparable  in 
run  time  performance  to  hand  generated  microcode. 

To  test  the  compiler  two  approaches  were  used.  The  first  was  to  make  timing 
estimates  by  counting  the  number  of  microinstructions  that  would  be  executed  to 
carry  out  the  computations  required  for  the  test  cases.  The  execution  time  of  each 
microinstruction,  which  is  typically  2Q0  nonoseconds,  were  used  as  factors  to 
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estimate  test  program  run  times.  The  second  approach  is  to  create  a  microprogram 
load  module  and  actually  execute  it  on  the  VAX  systems  in  the  BMDATC  Test  Bed. 
Because  of  the  granularity  (10  milliseconds)  of  the  elapsed  time  measurement 
facility  on  the  VAX  system,  it  is  necessary  to  run  the  test  case  many  times  in 
order  to  be  able  to  make  a  run  time  estimate.  The  control  of  the  number  of  test 
cycles  to  be  run  must  be  included  within  the  microcode  and  this  introduces  a 
bias  in  the  run  time  estimate.  There  is  an  additional  timing  overhead  intro¬ 
duced  by  the  invoking  program  as  it  passes  control  to  the  microprogram.  This  is 
a  one  time  bias  which  doesn't  seriously  affect  the  accuracy  of  the  run  time  tests. 
In  order  to  reduce  the  impact  of  these  test  errors,  the  number  of  test  run 
cycles  is  varied  to  obtain  an  average  run  time  for  each  test  case. 

To  illustrate  the  compilation  procedure  an  example  is  shown  in  figure  3. 

This  shows  a  sort  routine  which  sorts  an  array  of  numbers  into  ascending  order. 

This  program  has  a  best  and  worst  case  test  requirement.  The  best  case  is  when 
all  the  elements  to  be  sorted  are  already  in  ascending  order  and  the  worst  case 
occurs  when  they  are  initially  all  in  descending  order.  It  is  evident  from  this 
figure  as  to  where  some  overhead  is  inherent  in  microcode  generated  by  a  compiler. 
Do  loops  require  indices  to  be  specified  and  comparisons  must  be  made  on  iteration 
variables.  For  branches  and  if-then-else  constructs  it  is  necessary  to  compute 
a  predicate  before  determining  whether  to  go  to  the  branch  address  or  not.  In  spite 
of  these  overhead  aspects  of  compiled  microcode,  the  run  time  performance  of 
microcode  generated  from  quadruples  is  nearly  comparable  to  hand  generated  micro¬ 
code  as  is  shown  below. 

The  quadruple  to  microcode  translator  was  designed  to  produce  as  efficient 
microcode  as  possible.  The  design  approach  was  to  first  microcode  the  test  pro¬ 
blems  by  hand  and  compare  it  to  the  microcode  derived  by  translation  from  quad- 
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ruples.  The  differences  were  examined  and  the  translator  was  modified  to  reduce 
this  difference  to  the  maximum  extent  possible.  In  some  cases  this  involves 
changing  the  quadruple  input  from  the  PASCAL  to  quadruple  compiler.  Changes  are 
being  made  to  the  compiler  to  produce  the  desired  quadruples. 

A  final  effort,  which  is  currently  in  process,  is  to  redefine  the  MICRO  2 
MACROS  to  better  reflect  the  translation  of  quadruples  into  microcode.  The 
MICRO  2  MACROS  are  a  set  of  micro  assembly  language  routines  which  are  used  by 
OEC  Engineers  in  the  design  of  the  microprograms  to  interpret  the  VAX  machine 
instructions  set.  Several  instances  have  been  detected  where  new  MICRO  2  MACRO 
definitions  would  offer  still  better  hardware  performance  for  microcode  generated 
from  quadruples. 

A  major  consideration  in  the  design  of  both  the  PASCAL  to  quadruple  compiler 
and  quadruple  to  microcode  translator  are  precise  definitions  of  the  quadruple 
formats  and  specifications  in  terms  of  MrCRO  2  MACROS  of  each  quadruple  type. 

The  quadruple  consists  of  an  operation,  two  source  operands,  and  a  destination 
operand.  Each  operand  can  be  expressed  in  terms  of  different  data  types.  This 
gives  an  extremely  wide  range  of  quadruple  formats  and  data  definitions.  Appendix 
C  lists  the  basic  quadruple  formats  along  with  a  representative  set  of  quadruple 
types.  Note  that  for  each  operator  there  are  a  number  of  potential  operand  data 
types  leading  to  a  wide  variety  of  quadruples. 

Appendix  0  contains  a  table  showing  the  MICRO  2  MACROS  required  to  implement 
the  various  quadruple  types.  Again  a  given  quadruple  operator  may  have  a  number 
of  MICRO  2  MACRO  representations  corresponding  to  various  operand  data 
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tyDes.  Finally  in  appendix  E  a  couple  representative  microcoded  test  cases  are 
shown.  Some  of  these  as  indicated  are  derived  from  quadruple  representations  and 
some  are  representative  of  hand  coded  microcode. 

The  data  obtained  for  the  estimated  run  times  is  shown  in  table  I.  In 
general  the  microcode  produced  by  the  PASCAL  compiler  had  a  run  time  which 
averaged  45%  longer  than  the  run  time  of  the  equivalent  hand  generated  micro¬ 
code.  The  same  test  problems  coded  in  DEC  FORTRAN  IV  and  actually  timed  on 
the  VAX  system  had  an  average  run  time  which  was  312%  longer  than  the  hand 
generated  microcode.  The  average  speed  advantage  of  the  PASCAL  to  microcode 
compiler  over  the  FORTRAN  compiler  is  2.22  to  one.  This  result  is  a  little 
surprising  since  in  general  going  directly  to  microcode  should  produce  a  bigger 
speed  advantage.  Part  of  the  reason  is  the  inclusion  of  high  level  language 
instructions  in  the  VAX  11/780  instruction  set  which  facilitates  the  direct 
interpretation  of  a  HLL  on  the  VAX  system.  This  issue  will  be  more  extensively 
discussed  in  the  next  section. 

From  the  data  in  table  I,  we  can  conclude  that  the  PASCAL  to  microcode 
compiler  is  an  effective  software  development  tool.  It  produces  quite  efficient 
microcode  which  is  nearly  as  efficient  as  the  computational  performance  of  hand 
generated  microcode.  The  optimization  effort  on  the  quadruple  to  microcode 
translator  was  quite  successful  as  is  evidenced  by  the  efficiency  of  the  micro¬ 
code  produced.  Further  effort  should  be  expended  on  the  quadruple  to  microcode 
translator  to  see  if  its  performance  can  be  improved  still  further. 
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HIGH  LEVEL  LANGUAGE-HARDWARE  INTERFACE 


Interest  in  the  interface  between  the  HLL  compiler  and  the  CPU  hardware 
developed  during  the  first  phase  of  this  research  activity  (4)  when  it  was 
demonstrated  that  microcode  could  be  generated  for  CPU's  with  a  horizontal  con¬ 
trol  word  format.  Because  of  the  inherent  machine  independence  of  most  HLLs, 
it  was  necessary  to  introduce  some  intermediate  representation  which  was  also 
relatively  machine  independent  but  could  be  translated  into  efficient  microcode. 
Quadruples  were  selected  for  this  representation  but  other  choices  could 

be  made,  e.g.  triples,  polish  notation,  or  tree  structures.  Many  of  these 
alternate  choices  imply  the  use  of  a  push  down  stack  to  implement  the  trans¬ 
lation  into  microcode. 

The  selection  of  quadruples  as  an  intermediate  representation  was  largely 
motivated  by  the  availability  of  a  compiler  which  produced  quadruples  from  the 
PLM  language.  While  quadruples  are  relatively  machine  independent,  in  many  ways 
they  are  equivalent  to  machine  language.  Quadruples  are  directly  interpreted  in 
microcode  just  as  machine  instructions  are.  The  basic  difference  is  that  machine 
instructions  deal  with  specific  entities  within  the  CPU,  e.g.  registers,  constant 
generators,  shifters,  and  data  formats.  Quadruples  on  the  other  hand  specify  only 
operations  and  operand  types  and  addresses.  These  must  first  be  translated  into 
a  format  more  representative  of  the  CPU  architecture  before  being  converted  into 
microcode.  The  translation  from  quadruples  to  a  similar  but  machine  dependent 
format  is  the  basic  task  of  the  quadruple  to  microcode  translator.  Interpretation 
of  the  machine  dependent  quadruple  representation  is  relatively  straight  forward. 

A  significant  difference  between  the  VAX  11/780  and  the  POP  11/45  CPUs  is 
the  Inclusion  of  HLL  constructs  in  the  machine  language  of  the  VAX  system.  As 
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noted  above  this  significantly  improves  the  VAX  11/780  system  performance  in 
executing  algorithms  expressed  in  a  HLL  which  can  be  translated  directly  into 
these  machine  language  expressions.  Another  feature  of  the  VAX  11/780  machine 
instruction  set  is  its  rich  variety  of  operand  formats.  This  permits  literally 
thousands  of  different  instructions  to  be  generated  from  only  a  couple  hundred 
basic  types.  On  the  one  hand  this  presents  a  serious  problem  to  the  compiler 
designer  who  must  attempt  to  use  all  the  available  CPU  facilities.  The  price 
paid  for  all  this  flexibility  is  longer  run  time  and  storage  space  being  required 
for  the  compilation  procedure.  On  the  other  hand  the  run  time  performance  of 
compiled  machine  code  begins  to  approach  that  of  hand  generated  microcode. 

Tradeoffs  must  be  made  by  the  CPU  designers  to  provide  features  which  makes 
the  hardware  perform  efficiently  when  most  of  the  software  to  be  executed  is 
originally  expressed  in  a  HLL.  One  danger,  of  course,  is  to  create  a  machine 
Instruction  set  which  is  too  biased  towards  a  particular  HLL.  This  doesn't  appear 
to  be  the  case  for  the  VAX  11/780  but  this  will  be  better  known  when  performance 
data  becomes  available  for  HLL  compilers  for  languages  other  than  FORTRAN  IV  i.e., 

PASCAL,  ADA,  and  C  language  which  produce  object  code  for  the  VAX  system  become 
available. 

Of  particular  significance  to  the  generation  of  efficient  microcode  from 
quadruples  is  the  availability  of  registers  to  the  microprogrammer.  The  VAX  11/780 
has  seven  general  purpose  registers  available  at  both  the  machine  instruction  and 
microprogram  level.  In  addition  there  are  sixteen  scratch  pad  registers  available 
to  the  microprogrammer.  This  is  a  vast  improvement  over  the  registers  available 
in  the  POP  11/45  where  there  were  only  six  general  purpose  registers  available 
both  to  the  machine  language  programmer  and  microprogrammer.  Lack  of  registers 
made  array  access  very  difficult  and  inefficient  in  the  POP  11/45  and  prevented 
efficient  hand  microcoding  of  most  of  the  test  cases  used  in  this  research  effort. 
Array  manipulations  can  be  easily  handled  within  the  VAX  CPU  and  this  leads  to 
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significant  increases  in  run  time  performance  of  algorithms  requiring  array 
accesses . 

For  a  given  set  of  registers,  the  allocation  of  data  to  registers  is  the 
critical  factor  in  achieving  optimization  of  translating  quadruples  into  micro¬ 
code.  The  procedure  is  to  scan  through  the  quadruples  and  as  operands  involving 
data  or  addresses  are  identified,  an  assignment  is  made  to  an  available  register. 
If  no  register  is  available,  then  a  deallocation  algorithm  must  decide  which  data 
item  should  be  removed  from  the  registers.  These  functions  are  all  carried  out 
at  compile  time  and  don't  impact  the  run  time  performance  of  the  microprogram 
load  module  except  to  the  extent  that  data  allocation  to  registers  requires 
shifting  data  back  and  forth  between  main  storage  and  the  CPU  registers. 

Another  key  factor  in  generating  microcode  is  the  available  data  paths  within 
the  CPU.  Any  operation  on  data  for  a  particular  microinstruction  requires  that 
the  data  flow  within  the  CPU  is  on  separate  data  busses.  Determining  whether  this 
is  the  case  is  a  very  tedious  and  error  prone  procedure.  Fortunately  the  MICRO  2 
microcode  assembler  does  a  good  job  of  detecting  data  path  usage  conflicts  and 
provides  error  messages  to  the  microprogrammer.  The  tendency  in  the  design  of 
early  minicomputers  was  to  minimize  the  number  of  available  data  paths  which  had 
a  severe  impact  on  CPU  performance.  The  VAX  11/780  CPU  includes  enough  data  paths 
to  mostly  avoid  having  to  generate  additional  microinstructions  to  resolve  data 
path  conflicts.  Some  improvements  could  be  made  in  the  VAX  data  path  structure 
but  in  qeneral,  the  available  data  paths  are  more  than  adequate  to  support 
the  other  internal  processing  functions  in  the  CPU. 
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In  sunmary  then  the  main  factors  in  the  compiler  to  CPU  interface  are: 
available  machine  instructions  which  directly  reflect  HLL  structure,  the  supply 
of  registers  to  permit  storage  of  most  of  the  active  computation  variables,  and 
provision  of  adequate  data  path  capacity  to  avoid  having  to  generate  additional 
microinstructions  to  resolve  data  path  conflicts."  The  VAX  11/780  addresses  these 
issues  to  a  much  better  degree  than  did  the  POP  11/45.  As  a  result  the  run  time 
performance  of  HLL  compiler  generated  machine  language  and  microcode  approach  the 
performance  obtainable  from  hand  coded  microcode.  Based  upon  these  observations, 
more  study  of  the  compiler  CPU  interface  should  be  carried  out  to  determine 
more  optimum  designs  for  both  the  hardware  and  the  compilers. 
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CONCLUSION 


The  overall  research  objectives  of  this  contract  were  met  except  where 
uncontrollable  circumstances  intervened.  A  design  has  been  achieved  of  a  PASCAL 
to  VAX  11/780  microcode  compiler  which  produces  efficient  microprograms  and  the 
basic  problems  of  interfacing  this  compiler  to  the  VAX  operating  system  have 
been  solved.  Preliminary  support  documentation  has  been  prepared  to  support  the 
"user"  microprogramming  option  for  the  VAX  system  by  other  users  of  the  BMDATC 
Test  Bed. 

Because  of  the  efforts  to  optimize  the  performance  of  the  PASCAL  to  microcode 
compiler,  a  much  better  understanding  of  the  interface  between  a  HLL  and  the 
microcode  control  of  the  CPU  was  developed.  In  particular  algorithms  to  achieve 
optimum  use  of  the  registers  to  store  run  time  variables  were  of  especial  interest 
and  importance.  The  understanding  gained  of  the  generation  of  microcode  from 
compilers  can  lead  to  significant  improvement  in  productivity  of  microprogrammers 
and  expand  the  capability  for  the  use  of  this  performance  enhancing  facility. 

A  further  outgrowth  of  this  research  effort  is  the  building  of  a  broad  base 
for  future  research  into  the  techniques  of  enhancing  computer  performance  thru 
microprogranming.  Of  particular  significance  is  the  development  of  host  machine 
register  assignment  routines  and  the  invoking  programs  required  to  pass  control 
to  and  from  the  "user"  microprogram.  With  these  and  other  accomplishments,  the 
foundation  has  been  laid  for  a  broad  based  research  effort  to  develop  and  perfect 
support  tools  for  the  microprogramming  activity. 

While  much  has  been  accomplished,  future  research  is  required  to  develop 
techniques  for  enhancing  the  flexibility  of  microcode  compilers  by  demonstrating 
that  they  can  be  converted  at  the  input  HLL  interface  to  accept  new  computer 
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languages  and  at  the  host  machine  interface  to  demonstrate  they  can  produce 
microcode  for  new  host  machines.  This  presents  a  severe  challenge  to  researchers 
in  this  field  and  much  fundamental  work  remains  to  be  accomplished.  The  payoff 
from  this  research  can  be  the  ability  to  make  better  use  of  microprogramning 
both  to  enhance  the  computational  performance  of  a  given  host  machine  and  make  it 
more  flexible  with  regard  to  implementation  of  special  purpose  computational 
algorithms. 

The  research  activity  described  in  this  final  report  was  participated  in  by 
several  individuals  at  The  George  Washington  University.  In  particular  I  want 
to  acknowledge  the  efforts  of  Robert  Jones  who  did  this  original  layout  of  the 
PASCAL  and  revised  XCOM  compilers.  Mohamoud  Ketabchi  and  Abdol  Chenari  continued 
the  compiler  development  and  were  later  joined  in  this  activity  by  Mr.  Kwang. 

Mr.  Teh-Hsin  Yang  carried  out  the  compiler  optimization  and  evaluation  effort. 
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Appendix  A 

A1  Quadruple  To  VAX  11/780  MACRO  Language  Translator 
Showing  Documentation  in  Terms  of  Comments  for  Each 
Procedure 

A2  Quadruple  To  VAX  11/780  Microcode  Translator  with 
Example  of  Microcode  Derived  from  Quadruples  for 
Fibonacci  Test  Case. 
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Appendix  A2 

Quadruple  To  VAX  11/780 
Microcode  Translator 
with  Example 
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"USER"  MICROPROGRAMMING  MANUAL  FOR  THE  DEC  VAX  11/780 
Teh-Hsin  Yang 
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2)  MICR02  -  Microprogramming  Assembler 

3)  How  to  load  and  execute  a  microprogram. 

4)  Using  Control  Command  to  debug  the  User  Microprogram. 
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1)  Sasic  Architecture  Features  of  VAX  11/780 

The  VAX  11/780  is  a  general  purpose  microprogrammed  computer.  It 
features  a  32  bit  word,  virtual  sotrage  which  provides  addressing  of2B 
Bytes,  an  8K  cache  storage  unit,  a  200  NS  basic  CPU  cycle,  and  a  powerful 
instruction  set.  This  system  includes  1024  words(96  bits)  of  "user" 
microprogratranable  control  store.  Support  software  for  microprogramming 
includes  an  assembler  and  debugger  which  permits  the  user  to  design  their 
own  microcode.  It  has  been  demonstrated  that  an  algorithm  expressed  dir¬ 
ectly  in  microcode  will  execute  faster  than  routines  programmed  in  assembly 
or  a  high  level  language  (HLL).  Before  describing  this  procedure  for  pre¬ 
paring  microprograms  for  the  VAX  11/780,  a  brief  overview  of  the  architecture 
of  the  CPU  will  be  given.  Special  attention  will  be  given  to  those  parts 
which  most  profoundly  affect  the  preparation  of  microprograms.  Figure  Dl, 
which  is  a  block  diagram  of  the  VAX  11/780  CPU,  will  support  the  ensuing 
discussion. 

The  VAX  11/780  CPU  has  two  sets  of  sixteen  32  bit  general  purpose 
registers.  A  number  of  these  registers  have  preassigned  functions  (Rj2 
=  Argument  list  pointer,  R13  =  Stack  frame  pointer,  R14  *  Stack  pointer, 

Rl5  =  PC.  Registers  R0  thru  R7  are  available  to  programmers.  The  output 
of  the  A  registers  passes  through  latch  LA  to  the  A  side  of  the  ALU  while 
the  output  of  the  B  registers  passes  .through  latch  LB  to  the  B  side  of  the 
ALU.  An  additional  set  of  16  working  32  Bit  Registers,  RC,  output  thru  an  LC 

latch  to  the  left  side  of  the  ALU.  The  RC  reaisters  are  onlv  available  at  the 
microcode  level. 

Two  other  32  bit  registers  are  named  D  and  Q.  They  can  be  gated  to 
either  side  of  the  ALU.  The  content  of  D  register  can  be  delivered  to  the 
Q  Register  directly  without  passing  through  the  ALU.  A  rotation  unit 
accepts  the  64  bits  from  the  Q  and  D  registers  and  rotates  it  by  the  amount 
specified  by  a  user  instruction  and  deposits  32  bits  of  the  shifted  result 
back  in  the  D  register.  The  D  register  also  acts  as  the  memory  data  register 
and  data  read  from  and  into  storage  must  pass  through  this  reqister. 

The  memory  address  register,  VA,  can  be  loaded  from  ALU  or  can  be 
incremented  by  4  which  advances  the  address  by  one  word  (32  bits).  There 
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is  an  auxiliary  10-bit  exponent  ALU  (EALU)  for  handling  floating  point 
exponent  computations.  Its  data  path  contains  a  register  named  SC  which 
is  used  for  shift  operations  and  masking  the  data  entering  the  left  side 
of  the  ALU.  Another  feature  of  the  VAX  CPU  is  the  constant  generator.  A 
ROM,  called  SK,  provides  64  16  bit  constants  available  to  the  left  side 
of  ALU. 

The  VAX  machine  is  controlled  by  a  horizontal  micro  control  word  which 
is  96  bits  wide  and  divided  into  30  control  fields.  This  represents  a 
horizontal  control  word  format  and  presents  a  complicated  microprogram 
design  problem.  Many  operations  may  be  carried  out  in  parallel.  A  detailed 
description  of  the  micro  control  word  is  not  given  here  since  it  is  des¬ 
cribed  in  other  manuals.  But  it  will  be  useful  to  give  a  brief  overview  of 
branch  micro-operations  for  each  control  word.  Thirteen  bits  are  used  to 
form  the  address  of  next  micro  instruction.  These  can  be  assigned  directly 
by  the  microprogram  which  corresponds  to  an  unconditional  branch  at  the 
microcode  level.  When  a  conditional  branch  is  required,  other  micro  control 
words  are  involved  in  establishing  the  branch  conditions  required.  The  BEN 
(Branch  Enable)  micro  control  field  contains  5  bits  which  are  used  to  specify 
twenty-six  branch  conditions.  From  this  control  field  it  is  possible  to 
select  different  program  status  words  (PSWs),  including  the  N,  Z,  V,  C  and 
other  flags  as  Branch  conditions.  These  conditions  are  ORed  with  the  low- 
order  bits  of  the  micro  address  field  (JMP)  to  form  the  address  of  the 
successor  microword.  In  order  to  make  a  conditional  branch  occur,  it  is 
necessary  to  make  sure  the  potential  branch  address  has  certain  bits 
equal  to  zero  so  that  the  OR  operation  produces  the  appropriate  branch 
address.  The  Micro-2  Assembler,  to  be  described  later,  has  the  ability  to 
make  appropriate  bits  of  a  branch  address  equal  to  zero  to  satisfy  the  OR 
requirements. 


The  process  of  writing,  loading,  and  executing  a  microprogram  in 
the  VAX  11/780  "User"  control  storage  area  is  described  here.  Figure 
02  is  a  Flowchart  which  shows  these  steps.  The  first  step  in  micropro¬ 
gramming  the  VAX  11/780  is  generation  of  the  microcode  source  load  module. 
To  do  this,  use  is  made  of  the  VAX  SOS  Editor  system  which  is  initiated 
thru  use  of  the  edit  command.  The  individual  microprogram  statements  are 
entered  along  with  conments  as  shown  by  example  in  Figure  D3.  While  not 
required,  certain  formating  procedures  are  recommended  to  improve  read¬ 
ability  of  this  microcode.  The  formating  procedures  are: 


MICROCODE  GENERATION  PR0T0CALS 


1.  Precede  Micro  instruction  by  any  general  comments 


2.  If  Micro  instruction  has  explicit  address— give  that  address  at  the 
left  margin— put  no  info  on  that  line. 


3.  If  Micro  instruction  has  a  label— give  label  at  left  margin  and  put  no 
other  information  on  that  line. 


4.  Include  as  many  Micro  instruction— parts  separated  by  commas  as  will 
fit  in  columns  starting  at  the  second  tab  (col  17)  and  going  to 
column  38. 


5.  Place  line  specific  comments  starting  at  5th  tab  Col  41. 


BUBBLE. HICi8 

•  TITLE  '  tUE'BLE  SORT* 

•TOC  "ATTACHED  MACRO  DEFINITIONS* 
ALU.Q+LB-l  •RAMX/Q.AMX/RAMX.BMX/LB.ALU/A+B* 

•  REGION  / 1400 » 17FF 

INPUTS 

ASSUMPTION 

Rl-I 

R2-J 

R3-K 

R4-TEMP0RARY  STORAGE 
R6-N 

R7-START  ADDRESS  OF  ARRAY 


r - - - - - - —  — - - —  f 

Q-RCR73  i  ARRAY  START  ADDRESS-Q 

j - 1 

START:  ALU_LA-KC.13» 

LA_RACR33  i  K-l  TO  K 

i - ( 

RCR33-ALU  I 

! - 1 

RCR13-ALU. 

ALU-KC.13  <  1=1 

t - 1 

loop:  d_la. 

LA_RACR33  -  i  K--D 


ALU_D-LB» 

LAB_RCR13.CLK.UBCC  (K-I  IN  ALU  . ; 

I - — - i 

ALU. NT  '  »  K>I? 

i - 1 

=0111  »K>I 

D-ALU.LEFT2. 

ALU-RCR13  !  1*4  TO  D  _ 

VA_ALU» 

ALU_QC+3D» J/PP  !GET  ARRAY <I>  ADDRES 

| - 1 

i  K<I 

ALU-LA-KC . 43 . 

LA_RACR33 . CLK . UBCC I K-0 

I - 1 

•  C31T  »  K>OT  -  . 

■0  I  C31=0  K>0 

J/START  i  BACK  TO  START 

I  C31=l  K<»0 

PC_PC+1 . CLR. IB.OPC. J/062  »  STOP  ■ 

I - 1 

pp:  DCL0NG3_CACHE  I  READ  FROM  MEMORY 

I - - » 

RCR43-DI  .  ....  ......  ...'I- 

LAB.RCR43  I  ARRAY (I)  TO  LA 

I - 1 

VA_VA+4.LAB_RCR43>  GET  ARRAYU+l)  ADDR 


Figure  3-  Microcode  Source 

Program  for  MICRO  2  Assemble! 


0CL0NG3-CACHE  »  READ  ARRAY: 1*1)  TO  D 
I - 1 

- - ALU_LA-Dj CLK. UBCC  -I.  ARRAY: I) -ARRAY CIL1X - - 

I - - 1 

LAB-RCR1 3 r ALU • NT  1 1  TO  LB 

•4111  »  ALU . N*0 r ARRAY  < I ) >ARRAY < 1*1 ) 


I ALU . N= 1 . ARRA Y < I + 1 ) <= ARR A Y < I > 

UA_ALU. 

ALU-Q+LB  I ARRAY: 1*1 >  ADDRESS  TO  VA 

| - - 

CACHE-DCL0NG3. 

ALU^Q*LB-1  (WRITE  ( ARRAY ( I > )  TO  ARRAYCI+1 > 

S  VA.ALU  ( ARRAYC I>  ADDRESS  TO  VA 

D.RCR43  ( ARRAY! 1*1)  TO  D 

I - - 

CACHE-DCL0NG3  • 

j/loop  iwrite  :arrat:i*i)>  to  array: I) 

(END  BUBBLE  SORT 


i 

i 


I 
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Initially  a  source  file  name  followed  by  ,MIC  is  entered  which 
notifies  the  VAX/VMS  operating  system  that  the  editor  is  creating  source 
microcode  modules.  By  using  the  command  $MICR0  2,  "file  name",  the  op¬ 
erating  system  is  notified  that  the  MICRO  2  assembler  is  to  be  used  to 
create  a  microcode  load  module  which  will  be  identified  as  ,MCR  and  will 
be  file  ,MCR;  N  where  N  is  the  number  of  Micro  Assemblies  executed.  The 
MICRO  2  language  lets  the  microprogrammers  express  the  actions 

of  a  microprogram  symbolically.  The  MICRO 
2  assembler  translates  this  symbolic  representation  into  micro  control 
words.  MICRO  2  also  allocates  control  storage  space  to  any  microwords  for 
which  it  isn't  specifically  identified.  In  allocating  control  storage 
address,  the  microprogrammer  has  three  choices: 

--Allocating  control  storage  location  for  a  microword  absolutely. 

—Specifying  a  constraint  on  its  control  stoarge  allocation  of  a  microword 
e.g.  making  the  lowest  order  bits  of  the  address  equal  zero. 

—Letting  MICRO  2  determine  the  microword  allocation. 

MICRO  2  detects  syntax  errors  in  the  source  microprogram.  The  syntax 
checking  is  based  on  two  parts: 

—First,  it  checks  whether  the  statements  are  recognizable  by  MICRO  2. 
Unrecognizable  statements  create  an  error  message. 

--Second,  a  check  of  data  path  conflicts  is  made.  If  the  data  paths  overlap 
for  a  particular  microinstruction,  an  error  message  is  generated. 

The  MICRO  2  source  language  also  contains  some  "pseudo-operations" 
which  don't  generate  microcode  but  give  commands  to  the  assembler.  This  is 
done  by  either  specifying  a  co^snand  to  the  assembler,  or  by  influencing  the 
output  of  the  assembler.  In  order  to  simplify  the  writing  of  a  microprogram, 
the  VAX  11/780  "user"  microprogram  support  offers  a  group  of  MACRO  definitions 
expressed  in  a  language  recognized  by  MICRO  2.  These  MACRO  definitions  are 
oresented  in  the  VAX  11/780  microprogramming  tools  user's  guide. 

'*ey  are  categorized  as  described  below: 

.ALU_0  .  .  .  thru  ALU_D.  AND  .  .  . 

H,  0(8).  .  thru  ALU  D.XOR 
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;ALU_K  .  .  .  thru  ALU_PC  .  .  . 

; ALU  Q  .  .  .  thru  CACHE  .  .  . 

;D_0  ...  thru  D_CACHE 

;D_DAL  .  .  .  thru  D_D  .  .  . 

;D_INT.SUM  .  .  .thru  D_PC  .  .  . 

;D_Q  ... 

; D _ R  .  .  .  thru  D  &  VA_  .  .  . 

;EALU_  .  .  .  thru  FE_  ... 

;ID_  .  .  .  thru  LC_.  .  . 

;PC_  .  .  .  thru  PC  &  VA_  ... 

;Q_0  .  .  .  thru  Q_D  .  .  . 

;  Q _ I B  ...  thru  Q_PC 

;Q_Q  .  .  .  thru  Q  &  VA_.  .  . 

;R[  ]_0  .  .  ..thru  R[  ]  _PACK.FP  > 

;RC [  ]  _0  .  thru  RC  [  ]  _D  .  .  . 

;SC_0(A) 

;SD_  .  .  .  thru  VA_  ... 

;  MATCH  States 

;Non  transfer  functions 

•.Branch  enable  Macro  definitions 

If  the  microprogrammer  doesn't  find. a  MACRO  which  satisfies  a  requirement, 
two  steps  can  be  taken.  One  is  to  specify  the  data  path  and  microoperations 
directly  which  is  time  consuming  and  difficult.-  The  second  is  to  define  a  - 
new  Macro  which  accomplishes  the  desired  CPU  action. 


3)  How  to  load  and  execute  a  microprogram. 

Upon  completion  of  the  generation  of  a  source  microprogram  in  the 
MICRO  2  assembly  language  including  corrections  to  all  errors  detected, 
the  next  step  is  to  load  the  microprogram  into  "user"  control  storage. 

This  is  accomplished  by  first  assembling  the  source  microprogram  by  issuing 
the  command: 

$MICR0  2  file  name  and  depressing  the  return  key. 

The  MICRO  2  assembler  assembles  the  source  program  and  produces  a  listing 
file  (.MCR).  To  obtain  a  copy  of  the  assembled  program,  issue  the  command: 

$TYPE  or  PRINT  file  name  .MCR  and  depress  the  return  key. 

Depending  on  the  type  of  terminal  being  used  the  response  will  be  a  listing 
either  on  the  CRT  display,  a  keyboard  printer  (TYPE)  or  on  a  line  printer 
(PRINT).  The  listing  shows  the  source  program  and  the  corresponding  micro¬ 
code.  It  also  indicates  syntax  errors  if  any  exist.  When  all  errors  are 
corrected,  use  the  command: 

$MICLD 

The  assembled  program  is  loaded  into  writable  control  storage  (WCS).  The 
accomplishment  of  this  load  is  specified  at  a  CRT  terminal  by  the  appearance 
of  the  prompt  symbol  indicating  that  the  VMS  control  program  is  awaiting 
the  next  command.  In  order  to  execute  the  assembled  microprogram  it  is 
necessary  to  transfer  control  of  the  VAX  11/780  CPU  over  to  the  microcode 
now  resident  in  the  WCS.  This  requires  a  command  to  the  Virtual  Machine 
System  (VMS).  To  do  this  an  invoking  program  written  in  VAX  MACRO  assembler 
language  is  required.  The  VAX  11/780  MACRO  assembly  language  has  an  ex¬ 
tended  function  call,  (XFC),  instruction  which  transfers  control  from  VMS 
to  the  microcode  in  WCS.  The  invoking  program  must  also  set  up  any  para¬ 
meters  to  be  passed  to  the  microcode  either  by  inserting  values  in  registers 
or  through  a  parameter  list  in  main  memory.  A  pointer  to  this  parameter 
list  must  be  stored  in  a  register.  Also  to  be  able  to  check  the  result  of 
running  a  microprogram  a  labeled  NOP  instruction  serves  as  a  breakpoint  which 
can  be  used  to  check  the  result  of  executing  microcode  by  using  the  MACRO 
Debug  facility.  The  sample  invoking  program  which  in  addition  to  executing 
the  microcode  in  WCS  also  measures  microcode  execution  time  is  shown  in 
figure  D5. 
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TYPE  SORTTEST . MAR 

.TITLE 

SORTTEST 

.ENTRY 

SORTTEST,  "MO 

♦  CMNRNI _ S 

ROUTIN= 

SAVEC  f  SAVE  AND  CHANGE  XFC  VECTOR 

♦CREATE 

FAB=OUTFAB 

♦CONNECT 

RAB=OUTRAB 

♦ASCTIM.S 

,  TIMBUF 

=ATIMENOUr» 

H0VC3 

*32, ATIMENOU, TIMOUT 

♦PUT 

RAB=OUTRAB"  * 

'  _• 

MOVAL 

AR.RO  JSTARTING  ADDR  OF  ARRAY  - 

J  „ - - - 

-i  •  ' 

xrc 

♦  CMKRNI _ S 

ROUTINE 

rsvec 

*EXIT_S 

♦ASCTIM_S 

.TIMBUF 

=ATIMENOU,»  - 

M0VC3 

♦32, ATIMENOU .TIMOUT  • 

♦PUT 

RAB=OUTRAB  --  ",  V .  ' 

♦CLOSE 

FAB=OUTFAB  -  .-,-t  .  *  . 

f - - - 

-I 

*  ,  .  J 

savec: 

.WORD 

o  ■ 

. . 

MOVL 

SCB*AL_BASE+20» RIVEC  ->-i- 

MOVL 

*2  f  SCB* AL—BASE+ 20 •  r 

RET 

^  ‘  ;  .4  /  '*  . 

y - - - 

-1 

RSVECI 

.WORD 

o  ••  -  •  :■  :*■  '••  •-•>. 7  •••  *.i* 

i  • 

MOVL 

RIVEC.  SCB*AL_BASE+20 

RET-  - 

-  .--r:,  -'iv’ 

-I 

■***•’-  —•  '* \  '  .  ^  •. 

.psbct 

VECTOR, LONG  r  -X  -  •-  J 

rivec: 

.LONG 

0  •  i  f  ...  (  v.. 

j. 

.LONG 

0  v  .**  V. 

» - t - 

— 

i  * 

outfab: 

♦FAB 

FNM“TIMBF»RFM=FIX»MRS=TIMSIZ,RAT=CR 

outrab: 

*RAB 

FAB=OUTFAB . RBF-T IMOUT . RSZ-T IMSIZ 

timout: 

.  BLKB 

32 

TIMSIZ* 

.-TIMOUT 

atimenou: 

.LONG 

20*-10$ 

.LONG  10* 

io»: 

.BLKB 

32 

20*: 

.BLKB 

0 

ar: 

.BLKL 

25S 

* 

.END 

SORTTEST 

-  :1- 


Figure  5  Microcode  Invoking  Program  in  VAX  11/780. 
MACRO  Language 
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The  steps  to  use  the  VMS  commands  to  execute  the  MACRO  invoking 
program  assuming  the  microprogram  is  loaded  in  WCS  are: 

$  MACRO  invoking  file  name 

$  LINK  invoking  file  name  +  SYS  $  System :SYS.STB/SEL 
$  RUN  invoking  file  name 

Execution  begins  under  VMS  of  the  MACRO  invoking  program  and  when  the 
XFC  instruction  is  encountered,  the  microcode  is  executed.  Control  is 
returned  to  VMS  and  the  balance  of  the  assembly  language  program  is  ex¬ 
ecuted  . 
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4)  Control  Command  to  Debug  the  "User11  Microprogram. 

It  is  Very  important  to  have  a  debug  facility  to  check  the  MICRO 
Code  when  it  is  running  in  WCS.  MICRO  2  includes  commands  that  can  cause 
the  microprogram  to  be  executed  step  by  step  or  to  execute  up  to  any 
microinstruction  in  the  microprograms  in  WCS.  A  list  of  user's  commands 
and  terminal  reaction  is  shown  below: 

TERMINAL  COMMENTS 

USER  COMMAND  INFORMATION  _ _ 


Prompt  Command 


$ 

MACRO  File  Name 

Nothing,  if  nothing 
is  wrong 

Assemble  the  invoking 
program 

$ 

LINK  File  Name 

Nothing,  if  nothing 

Link  the  invoking 

SYSTEM: SYS. 

STB/SEL 

is  wrong 

program 

$ 

MICRO  2  File  Name 

Nothing,  if  nothing 
is  wrong 

Assemble  the  microcode 

$ 

MICLD  File  Name 

WCS  Load  Complete 

Loads  microcode  into 

WCS 

$ 

CTRL 

Show  >» 

Turn  to  microprogram 

$ 

RUN  File  Name 

>»  WCS 

Start  Execution  of 
invoking  Program  & 
invokes  WCS  Debugger 

>» 

SE  SOMM 

Show  »> 

Set-Stop  on  Micro  Match 
SOMM 

»> 

H 

Halted  at  XXX 

Halt 

>» 

C 

Nothing 

Continue 

WCS  > 

C 

Message  as  appropriate 
when  execution  stops 

Continue  to  execute 
next  instruction 

WCS  > 

E  RA  [  ] 

Show  content  of  register 

RA  (HEX) 

Exam  the  :ontent  of 

RA  [  ] 

WCS  > 

E  DR 

Show  content  of  D  register 

WCS  > 

RET 

»>  Information  on  CPU  status 

Return  to  MACRO  Control 

»> 

Show 

Shows  CPU  status 

»> 

E/ID  21 

E:  Examine 

ID:  Internal  Register 
21:  SOMM  Register 
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QUADRUPLE  DEFINITIONS 

The  following  quadruple  definitions  are  to  be  considered  standard  for  the 
interface  between  the  PASCAL  and  XPL  to  quadruple  compilers  and  the  Quadruple  to 
VAX  MACRO  (Assembly  Language)  and  MICRO  2  (Microcode)  translators. 

Two  formats  are  described  below.  One  is  for  the  complete  quadruple  while  the 
second  is  for  operands  1,  2  and  3: 


Quadruple  Format: 

Operation 

Data 

Type 

Operand  1 

Operand  2 

Operand  3 

14  17 

22 

25  35 

CJ 

CD 

-F» 

CD 

51  61 

xxxxxxxxx  I 

R 
b 


Operand  Format 


Operand  1 

25 

26 

27 

35 

Operand  2 

38 

39 

40 

48 

Operand  3 

51 

52 

53 

61 

# 

U 

N 

N 

0  T 
N 


R 

I 

# 

<a 

$ 

T 

U 

N 


Operands  are  real  values 
Operands  are  integer  values 
Operand  is  a  numeric  value 
Operand  is  an  address  of  a  data  item 

Operand  is  temporary  location  in  storage  on  a  register 
Operand  is  a  storage  address  assigned  to  a  data  item 
Numeric  value 
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specified  in  operand  2  and  place 
the  shifted  resultant  value  in 
the  storage  or  temporary  storage 
location  specified  in  operand  3. 
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UNST  Indicates  that  certain  variables 

associated  with  this  procedure 
should  be  removed  from  the 
compiler  stacks. 


QUADRUPLES  TO  MICRO  CODE  SPECIFICATION 

ASSUMPTIONS : 

1)  The  constant  which  does  not  exist  in  Fk,  Sk  would  be  stored  in  Reg  C 

2)  (T)  Micro  Routine  if  constant  is  in  Sk 

( 2 )  Micro  Routine  if  constant  is  not  in  Sk 

3)  #-Numerical  Value 

4)  S-  Storage  Address 

5)  T- Temporary  Storage  Address 

6)  S(R)  -  Storage  Address  in  Register 

7)  RA  [  ]  -  A  Register  value 

RB  [  ]  -  B  Register  value 

RC  [  ]  -  C  Register  value 

8)  D  [  ]  -  D  Register  value 

Q  [  ]  -  Q  Register  value 

9)  @R  -Indirect  address  of  variable  in  storage  stored  in  a  register 

@T  -Indirect  address  of  variable  in  storage  stored  in  a  temporary  storage  locatio 

10)  ADDR  -  Branch  Address 
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QUAD 

OPERATOR 


QUAD 

OPERAND 


MICRO  2 


D-2 


QUAD 

OPERAND 

MICRO  2 

©Q_RC[  ]; 

ALU  LA  Q, 

LAB  R[  ], 

CLK.UBCC; 

ALU  ?  ; 

=  1110 

J/ADDR; 

Si  ,  S2  ,  T 

ADDR, 

Q  R[S2];  " 

ALU  LA  Q, 

LAB  R[Si], 

CLK.UBCC; 

ALU  ?  ; 

=  1110 

J/ADDR; 

it  ,  S(R),  S(R) 

QLAB  R[S], 

ALU  LA  +  K[  ] , 

R[S]_ALU; 

(Dq  RC[  ]; 

LAB  R[S] ; 

ALU  LA  +  Q, 

R[S]_ALU; 

Si(R),  S2(R),  S3 

VA  RC[  ] , 

LAB  R[SiJ; 

(LA  R[S2], 

(ALU  LAf+jLB, 

D_ALU; 

Cache_D  [Long] ; 

S(R),  If  ,  S(R) 

©LAB  R[S] ; 

ALU  LA  K[  ] ; 

R[S]_ALU; 

(Dq  RC[  ]} 

LAB  R[SJ; 

ALU  LA  Q, 

R[S]_ALU; 

sl(r) t  S2CR)>S3 

VA  RC[  ]  f 

LAB  R[S2] ; 

(LA  R[SiJ  , 

(ALU  LaI-]LB, 

(  D_ALU ; 

Cache_D[Long] ; 

1 


PC_PC  +  1,  CLR.IB.OPC 
J/062; 


